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A BASIS FOR SLICING BIRKHOFF POLYTOPES 


TREVOR GLYNN 


Abstract. We present a change of basis that may allow more efficient calculation of 
the volumes of Birkhoff polytopes using a slicing method. We construct the basis from a 
special set of square matrices. We explain how to construct this basis easily for any Birkhoff 
polytope, and give examples of its use. We also discuss possible directions for future work. 


1. Introduction 

The nth Birkhoff polytope, which we denote B^, is the convex hull of the set of nonnegative 
n X n matrices whose rows and columns sum to 1. Much has been written about these 
polytopes and their properties. (See [1], [4], and [5], for example.) Of these properties, the 
volume is of special interest. The exact volume of Bn is known only for n up to 10. Beck and 
Pixton gave the exact volume of Biq in [1]; the calculation took over a year using dozens of 
computers. In [3], Canfield and McKay gave an asymptotic formula for the volume of Bn- 
De Loera et. al. [7] found an exact formula for the volume of Bn, but its calculation remains 
intractable for large values of n. 

Results from Liu [6] offer a new algorithm to calculate the volume of a polytope by slicing. 
The method requires the polytope to satisfy special integrality conditions. To describe these 
conditions, it is necessary to introduce some basic definitions. The following discussion is a 
simplified summary of the material covered in [6]. For the sake of simplicity, we will use the 
standard basis instead of an arbitrary one. 

We concern ourselves with the real vector space and the lattice Z'^. First, we define 
two functions: Let —)■ be the function that maps the vector [xi, X 2 -i - ■ ■, Xd) to 

the vector (ti,X 2 , • • • ,Xk)- Let tt^: ^ be the function that maps the vector y to 

its inverse image under In effect, one can think of ’Xkiy) as shifting the entire affine 
space by the vector y. We will call the affine space a slicing space, for reasons 

that will become clear shortly. 

Now we are ready to introduce the special integrality conditions. We say that an affine 
space U C is integral if 

7r(dimC([/nZ^) = Zd™^. 

Likewise, we say a polytope is affinely integral if its affine hull is integral. Finally, we say 
that a polytope is k-integral if all of its faces of dimension less than or equal to k are affinely 
integral. 

The condition of fc-integrality is very strong and difficult to satisfy for high values of k. 
Fortunately, there is a weaker condition that will suffice for our needs. We say that an affine 
space U C is in general position if 

7r(dimC(^) = MdimC/_ 
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We define what it means for a polytope is in k-general position in the predictable way. A 
polytope is in ajfinely general position if its afhne hull is in general position. A polytope is 
in fc-general position if is faces of dimension less than or equal to k are in affinely general 
position. 

It is a theorem of [6] that for all d-dimensional polytopes P C the following holds: If 
P is (k — l)-integral and in fc-general position, then the normalized volume of P, which we 
will write NVol(P), is 

(1.1) NVol(P)= J^Vob_fc(7rfc(y)nP), 

j;g7r('=)(P)nZ*’ 

where VolD-kiQ) is the volume of Q with respect to the lattice 
Algorithmically, one can imagine the formula in this way: 

(i) Project P into 

(ii) Collect the integral points of the projection. 

(iii) Derive the slicing spaces from the integral points, and intersect them with the original 
polytope P. This step creates “slices” of P, which will have dimension D — k or less. 

(iv) Sum the volumes of the slices. 

Example 1. In the case where k = 1, the slicing spaces are hyperplanes. Figure 1 depicts 
an example of this case. This is Example 2.1 in [6]. 

Let P be the polytope in Figure 1. We can apply formula (1.1) to P as follows: 

Vol (tti (y) n P) = Vol (tti (y) n P) 

j?e7r(i)(P)nz j?e{o,i,2,3,4} 

=0+l+4+3+0 
= 8 

Note that the 1-dimensional polytopes {(0,0,0)} and {(4,0,0)} are still considered slices, 
although they contribute nothing to the total volume. 

Notice how none of the edges of the polytope in Figure 1 lie parallel to the slices. This is 
an important observation; it demonstrates a general rule that is worth emphasizing. 

Lemma 2. A polytope is in 1-general position if and only if none of its edges are parallel 
with one (equiv. any) of the slieing spaees. 

Proof. This follows from the definition of the fc-general position property. □ 

Formula (1.1) only works on polytopes that are {k — l)-integral and in /c-general position. 
Nevertheless, it is possible to apply the algorithm to any rational polytope by performing a 
special change of basis and then scaling the coordinates. 

We plan to apply formula (1.1) to Birkhoff polytopes with k = 1. It is our hope that this 
may enable the derivation of a more tractable formula for the volume of Bn- To do this, we 
need to ensure the Birkhoff polytopes are 0-integral and in 1-general position. Fortunately, the 
property of being 0-integral is equivalent to being integral. Therefore one of the prerequisites 
is already satisfied. 

However, it turns out that none of the Birkhoff polytopes beyond B 2 are in 1-general 
position. In this paper we devise a change of basis that will put any Birkhoff polytope in 
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(0,0,0) (4,0,0) 

Figure 1 . A polytope that is 1-integral, with its slices drawn and shaded. 


1-general position. Using this change of basis, it will be possible to apply formula (1.1) to 
any Birkhoff polytope, with k = 1. 

Before we explain how we find this change of basis, it is worthwhile to explain why most of 
the Birkhoff polytopes are not in 1-general position. It is known that the direction vectors of 
the edges of the nth Birkhoff polytope comprise a special set of (—1,0, l)-matrices, which we 
will call M-n- These matrices are discussed in the work of Brualdi and Gibson [2]. Here we 
give only a summary. 

The matrices in correspond with directed simple cycles in the complete bipartite 
graph with elements ui, 112 , • • •, and Ui, U 2 , ..., The correspondence is as follows: 
A value of 1 for element M[i,j] corresponds with an edge from Ui to Uj, and a value of —1 
corresponds with an edge to Ui from Vj. In particular, if n > 2, then we can form a simple 
cycle in Kn^n that includes neither ui nor vi. Such a cycle will correspond with an edge in 
that is not in affinely general position. Hence is not in 1-general position for all n > 2. 

2. Main Result 

From Lemma 2, we know that we need to change to a basis such that none of the slicing 
spaces (in this case, hyperplanes) of Bn are parallel to any edges. We will work backwards— 
first identifying a set of hyperplanes by their shared normal vector, and then deriving the 
basis from this set. 

Definition 3. For all integers n > 1, let V„ = ( 0 ^^) be the square matrix 
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and define Vi = 1. 

We will prove the following theorem: 

Theorem 4. For all integers n > 0 and edges e of Bn, the inner product V„ • dir e is nonzero. 

Thus we will show that the matrix is not orthogonal to the direction vector dir e of any 
edge e of the nth Birkhoff polytope. From this fact, we will conclude that any hyperplane 
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normal to V„ satisfies the condition in Lemma 2. Before we prove this result, we will need to 
make several dehnitions. 

First of all, since the matrices M represent simple cycles in Kn^m there can never be more 
than one 1 or —1 in any row or column of M. Furthermore, if a 1 (or —1) occurs in any 
particular row or column, then there must also be a —1 (resp. 1) in the same row or column. 
These observations will be useful later. 

Our goal is to show that the dot product of V„ with any matrix M G Mn is never zero. We 
can visualize the product • M as a special cycle consisting of elements in the matrix V„, 
where we consider two entries adjacent if they share a row or column. Since the matrix M 
contains negative entries, we must assign a sign to alternating elements in the cycle. This 
way, we do not need to bother with the set M. at all, and can think just in terms of the 
matrix V„. 

Definition 5. A Birkhoff cycle is a sequence of entries aij, aik, ajk, ■ ■ ■, aij of Vn that 
correspond with the entries in a matrix M in A4n- We give these elements a sign according 
to the sign of the corresponding entry in M. When we speak of the sign of an element in 
a Birkhoff cycle, it is this sign we are referring to, not the sign of the entry itself. (The 
vector V„ is nonnegative, so there should be no ambiguity.) 



Figure 2. Two Birkhoff cycles in the matrix V 4 . The shaded elements are 
negative, and the unshaded ones are positive. 

When we think of Birkhoff cycles, it can be useful to remember that their elements come 
from matrices. We may want to say that one element is “left of” or “above” another. When 
we say these things, we are referring to the elements’ relative positions as matrix entries. 

Definition 6. We say that an element of a Birkhoff cycle in is the maximal element 
if, for all other cycle elements aij, we have either r > i, or r = i and s > j. Informally, the 
maximal element is the bottom and rightmost element. 

Example 7. In the cycle in Figure 2a, the maximal element is 48. In the cycle in Figure 2b, 
the maximal element is 32. 

Giving a sign to all of the elements in a Birkhoff cycle can be done in two ways, and it will 
be helpful for us to distinguish between the two. 
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Definition 8. If the maximal element is positive, we will say that the cycle itself is positive. 
Likewise, if the maximal element is negative, the cycle is negative. 

Example 9. Figure 2a is a positive cycle, and Figure 2b is a negative cycle. 

Since Birkhoff cycles are intended to represent the dot product V„ • M, which is a scalar 
quantity, there should be a natural way of expressing that scalar as a property of the cycle 
itself. The following dehnition satishes this requirement. 

Definition 10. The sum of a Birkhoff cycle is the sum of its elements, signs included. 

Example 11. The sum of the cycle in Figure 2a is 30, and that of the cycle in Figure 2b 
is —20. 

Notation. If c is a Birkhoff cycle, we will use ^ c to denote its sum. 

Notice that, in the previous example, the positive cycle has a positive sum, whereas the 
negative cycle has a negative sum. It turns out that this is not merely a coincidence, as we 
will prove now. 

Lemma 12. The sum of a positive Birkhoff cycle is positive. The sum of a negative Birkhoff 
cycle is negative. 

Proof. Since we can turn a negative cycle into a positive one by reversing all of the signs, we 
can restrict ourselves to the positive case. Let c be an arbitrary positive cycle whose maximal 
element is in row r. Recall that c must contain exactly zero or two nonzero elements from 
each row and column of V„. With this in mind, we know that the two cycle elements in row r 
contribute a total value of no less than to ^ c. This happens then the maximal element 
is immediately to the right of its (negative) neighbor. Since every row of V„ is an arithmetic 
sequence, this is true no matter which column contains the maximal element. 

Now we ask ourselves what the rows above r contribute. We can overestimate by forgetting 
the positive elements and just taking the sum of the negative ones. Let ii < i 2 < ... < is 
be the rows above r with elements that are part of the cycle. We know from the definition 
of A4 that each of these rows contains exactly one element that is in c and is negative. For 
each row ik, let jk be the column containing this element. The columns ji, j2, ..., js must 
be distinct because c cannot contain two negative elements from any column of V„. 

The sum of the negative elements above row r, not including their sign, is 

S 

( 2 . 1 ) 5 = 

fc=i 

In the right-hand side of equation (2.1), the terms that make up each summand are jk — 1 
and Since the row indices ik are in increasing order, we know the terms are also 

in increasing order. However, we know nothing about the order of the column indices jk. The 
value of S would be greatest if the column indices were also in increasing order. Therefore, if 
we let j[ < j2 < ' '' < j's b® tbe same column indices, then 

S S 

Uk - l)n*'=-' < {fk - 

k=l k=l 
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We can go even further by choosing the maximum possible values of ik and and maximizing 
their number. 

s r—2 r—3 

[j'j. — [n — r + k + [n — r + k + 2)n^ 

k—1 k~l k—0 

Now we can say that 


J]] c > - 5 

r-3 

> {n — r + k + 2)n^. 

fc =0 

When r = 3, we get the result X] ^ > 0. In this case, we can make the inequality strict by 
simply checking that the “worst case” value of S is unachievable unless there exists at least 
one positive cycle element above row r. 

When r > 3, we get 

r—3 

c > (n — r + fc + 2)n^ 

fc =0 

= — ((n — + (n — 2)n’’“‘^ + • • • + (n — r + 2)) 

= rf~‘^ — {rf~^ — — 2rf~^ + • • • + n — (r — 2)) 

= + 2n”“^ + • • • + (r — 4)n + (r — 2) 

> 0 . 


We conclude that c has a positive sum. Since we chose the cycle c arbitrarily, it follows that 
all positive Birkhoff cycles have positive sums. □ 

Now we can give the proof of Theorem 4. 

Proof. By applying Lemma 12, the proof of Theorem 4 is trivial: Every edge of corresponds 
with a matrix M G A4, and the dot product • M is the sum of the Birkhoff cycle, which 
we have just shown to be nonzero. □ 


3. Constructing the Basis 

To construct a new basis for the nth Birkhoff polytope, we begin by choosing vf — 1 linearly 
independent vectors from the hyperplane Hn whose normal vector is parallel to V„. Since 
is not orthogonal to any of the edge direction vectors of we know that is not parallel 
with any of the edges. Hence we can use this hyperplane to take slices of Bn- 

There are several obvious choices for the particular vectors that will comprise our basis. 
For instance, there exist 2n — 1 elementary (0, l)-matrices with a single nonzero element in 
the top row or leftmost column. Since this row and column are hlled with zeros in V„, clearly 
these elementary matrices he in Hn- 

For the rest of the basis, we take the (n — 1)^ — 1 vectors v that are zero everywhere except 
for n[2,2] = —aij and v[i,j] = 1, where i,j>l and i and j are not both 2. The hnal vector 
will be the elementary matrix with a one in the second row and second column. 

Example 13. The matrices in (3.1) form the basis for V 3 . 
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/l 0 0\ /o 1 0\ /o 0 l\ /o 0 0\ /o 0 o\ 

0 0 0 0 0 0 0 0 01 0 0 0 0 0 

Vo 0 oy Vo 0 oy Vo 0 oy Vo 0 oy Vi o oy 

/o 0 0\ /o 0 0\ /o 0 0\ /o 0 o\ 

0 -2 1 0 -3 0 0 -6 0 0 1 0 

Vo 0 oy Vo 1 oy Vo 0 y Vo 0 oy 

Earlier we mentioned the possible need for scaling the coordinates of the polytope after 
changing basis. But it turns out that this basis is unimodular. (This is easy to see by making 
a matrix where the nth row is formed by concatenating the rows of the nth basis vector.) 
Therefore, there is actually no need to scale the coordinates of the polytope; changing basis 
is all that is required. 

Using this change of basis on a Birkhoff polytope will put it in 1-general position, which 
will allow the use of the slicing method described in [ 6 ]. 

There is an elegant interpretation of this result. The first eight basis vectors in (3.1) span 

a hyperplane that is parallel to every slice of The ninth basis vector is the offset that 

separates the slices. 

After applying the change of basis to the Birkhoff polytopes and S 4 , the resulting 
polytopes have the vertex representations shown in Figure 3. 


B ,: {(1,0,0,1,0,0,1,0,0), (2,0,1,0,0,1,1,0,0), (3, 0, 0,1,1, 0, 0,1, 0), 

(5,1,0,0,0,1,0,1,0), ( 6 ,0,1,0,1,0,0,0,1), (7,1, 0, 0, 0, 0, 0, 0,1)} 

: {( 6 ,0,0,0,1,0,1,0,0,1,0,0,1,0,0,0), (7,0,0,1,0,0,0,1,0,1,0,0,1,0,0,0), 

(9,0,0,0,1,0,0,0,0,0,1,0,1,0,0,0), (11, 0,1, 0, 0, 0, 0,1, 0, 0,1, 0,1, 0, 0, 0), 
(13,0,0,1,0,0,0,0,0,0,0,1,1,0,0,0), (14,0,1,0,0,0,1,0,0,0,0,1,1,0,0,0), 
(18,0,0,0,1,0,1,0,1,0,0,0,0,1,0,0), (19,0,0,1,0,0,0,1,1,0,0,0,0,1,0,0), 
(24,0,0,0,1,1,0,0,0,0,1,0,0,1,0,0), (27,1,0,0,0,0,0,1,0,0,1,0,0,1,0,0), 
(28,0,0,1,0,1,0,0,0,0,0,1,0,1,0,0), (30,1,0,0,0,0,1,0,0,0,0,1,0,1,0,0), 
(33,0,0,0,1,0,0,0,1,0,0,0,0,0,1,0), (35,0,1,0,0,0,0,1,1,0,0,0,0,0,1,0), 
(36,0,0,0,1,1,0,0,0,1,0,0,0,0,1,0), (39,1,0,0,0,0,0,1,0,1,0,0,0,0,1,0), 
(44,0,1,0,0,1,0,0,0,0,0,1,0,0,1,0), (45,1,0,0,0,0,0,0,0,0,0,1,0,0,1,0), 
(49,0,0,1,0,0,0,0,1,0,0,0,0,0,0,1), (50,0,1,0,0,0,1,0,1,0,0,0,0,0,0,1), 
(52,0,0,1,0,1,0,0,0,1,0,0,0,0,0,1), (54,1,0,0,0,0,1,0,0,1,0,0,0,0,0,1), 
(56,0,1,0,0,1,0,0,0,0,1,0,0,0,0,1), (57,1,0,0,0,0,0,0,0,0,1,0,0,0,0,1)} 


Figure 3. The vertices of S 3 and S 4 after performing the change of basis 
to put them into 1 -general position. 


7 



4. Future Work 


Although our basis makes it possible to apply the slicing method to any Birkhoff polytope, 
it is unclear how best to do so. It is possible that the slicing method will yield an exact 
formula that is easier to compute, or that has interesting properties in itself. 

It may be possible to invent new matrices, similar to the matrices V„, which are not 
orthogonal to any higher-dimensional facets of i?„. With such matrices, one could derive a 
basis that permits lower-dimensional slices. 
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